# High-precision scoring
Fsfairx LLaMA3 RM V0.1
A reward model trained on Meta-Llama-3-8B-Instruct for reward modeling in RLHF processes, supporting PPO, iterative SFT, and iterative DPO methods.
Large Language Model
Transformers

F
sfairXC
4,157
56
Cross Encoder Umberto Stsb
Cross-encoder model for Italian sentence similarity calculation based on the Umberto architecture
Text Embedding
Transformers Other

C
efederici
34
0
Featured Recommended AI Models